Markov decision process

Results: 537



#Item
481Policy-contingent abstraction for robust robot control  Joelle Pineau, Geoff Gordon and Sebastian Thrun School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Policy-contingent abstraction for robust robot control Joelle Pineau, Geoff Gordon and Sebastian Thrun School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2003-06-04 12:29:33
482Applying Metric-Trees to Belief-Point POMDPs  Joelle Pineau, Geoffrey Gordon School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Applying Metric-Trees to Belief-Point POMDPs Joelle Pineau, Geoffrey Gordon School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2004-01-17 12:21:11
483Distributed Planning in Hierarchical Factored MDPs  Carlos Guestrin Computer Science Dept Stanford University [removed]

Distributed Planning in Hierarchical Factored MDPs Carlos Guestrin Computer Science Dept Stanford University [removed]

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2003-06-04 17:02:40
484EXECUTION-TIME COMMUNICATION DECISIONS FOR COORDINATION OF MULTI-AGENT TEAMS Maayan Roth CMU-RI-TR-08-04

EXECUTION-TIME COMMUNICATION DECISIONS FOR COORDINATION OF MULTI-AGENT TEAMS Maayan Roth CMU-RI-TR-08-04

Add to Reading List

Source URL: www.ri.cmu.edu

Language: English - Date: 2012-08-21 12:43:37
485Applying Reinforcement Learning to Obstacle Avoidance  Josh Beitelspacher University of Oklahoma, 308 Cate Center Drive Box 5242, Norman, OK[removed]USA  [removed]

Applying Reinforcement Learning to Obstacle Avoidance Josh Beitelspacher University of Oklahoma, 308 Cate Center Drive Box 5242, Norman, OK[removed]USA [removed]

Add to Reading List

Source URL: www.netbeetle.com

Language: English - Date: 2011-01-10 03:02:38
486Fast approximate planning in POMDPs Geoff Gordon [removed] Joelle Pineau, Geoff Gordon, Sebastian Thrun. Point-based

Fast approximate planning in POMDPs Geoff Gordon [removed] Joelle Pineau, Geoff Gordon, Sebastian Thrun. Point-based

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2003-05-01 11:32:55
487Tree Based Hierarchical Reinforcement Learning William T. B. Uther August 2002 CMU-CS[removed]

Tree Based Hierarchical Reinforcement Learning William T. B. Uther August 2002 CMU-CS[removed]

Add to Reading List

Source URL: reports-archive.adm.cs.cmu.edu

Language: English - Date: 2003-03-11 11:32:52
488Policy Gradient vs. Value Function Approximation: A Reinforcement Learning Shootout Technical Report No. CS-TR[removed]February 2006 Josh Beitelspacher, Jason Fager, Greg Henriques, and Amy McGovern School of Computer Sci

Policy Gradient vs. Value Function Approximation: A Reinforcement Learning Shootout Technical Report No. CS-TR[removed]February 2006 Josh Beitelspacher, Jason Fager, Greg Henriques, and Amy McGovern School of Computer Sci

Add to Reading List

Source URL: www.netbeetle.com

Language: English - Date: 2011-01-10 03:02:38
489Algorithms for Inverse Reinfor
ement Learning Andrew Y. Ng Stuart Russell ang
s.berkeley.edu russell
s.berkeley.edu

Algorithms for Inverse Reinfor ement Learning Andrew Y. Ng Stuart Russell ang s.berkeley.edu russell s.berkeley.edu

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2007-04-04 16:42:02
490Trial-based Heuristic Tree Search for Finite Horizon MDPs

Trial-based Heuristic Tree Search for Finite Horizon MDPs

Add to Reading List

Source URL: www2.informatik.uni-freiburg.de

Language: English - Date: 2013-04-05 08:41:03